Type subsumption cache: handle unsolved type vars #19040

majocha · 2025-10-30T10:49:13Z

Description

Fixes #19037

Added repro test case.

The problem was in memoization of TType -> TypeStructure generation.
TType is inherently mutable. One TType instance can result in different TypeFeasiblySubsumesType outcomes and different TypeStructure key fragments, as type arguments get solved. For the purpose of type subsumption caching, we can treat as stable only fully solved types, with solved or rigid type vars. We still want the cache to use the unstable ones, but we must produce new TypeStructures from the TTypes as they evolve towards a solution.

Additionally, caching is now limited only to TType_app cases.

Note:

Cache keys do not take into account constraints. This seems to work in practice, because TypeFeasiblySubsumesType, where the cache is applied, does not use them either.

If this turns out to be a problem, or we ever need to use TypeStructures also somewhere else, we should emit a structural fingerprint of Typar.Constraints, too.

github-actions · 2025-10-30T10:49:52Z

❗ Release notes required

✅ Found changes and release notes in following paths:

Change path Release notes path Description

src/Compiler docs/release-notes/.FSharp.Compiler.Service/11.0.0.md

majocha · 2025-10-30T13:10:33Z

Yet another bug. This needs a careful look.

I need to run some benchmarks again, too and compare standalone builds with the original Vlad's implementation.

T-Gro · 2025-11-03T11:42:38Z

@majocha:
If we separated the solved and unsolved form of a typar, why wouldn't we want to cache (type subsumption especially) for both?
.e.g. constraints might be already there and the cache would hold info on type subsumption of generic typars.

It would equally help if two typars are both unsolved, but strip down to the same form.

Wasn't the mistake more the fact that solved/unsolved form of the same TType_var had always the same key (based on Stamp) ?

This is definitely a safe fix, but I wonder if it doesn't too much limit the potential of the cache for typars.

majocha · 2025-11-03T12:14:58Z

Wasn't the mistake more the fact that solved/unsolved form of the same TType_var had always the same key (based on Stamp) ?

Yes, that makes sense. It makes me wonder if we could also handle unsolved nullness better.

T-Gro · 2025-11-03T14:14:04Z

Nullness is not stamped, so in the case of unresolved/not yet resolved/ nullness, there is no identifier to hold onto.
The identity of the NullnessVar (possibly the leaf one if multiple are sequentially linked) object itself (via https://learn.microsoft.com/en-us/dotnet/api/system.runtime.compilerservices.runtimehelpers.gethashcode?view=net-9.0 )

majocha · 2025-11-03T16:40:46Z

This fix is accidental, the real problem is here:

fsharp/src/Compiler/Utilities/TypeHashing.fs

Line 480 in 9670f60

Extras.WeakMap.getOrCreate (fun ty -> accumulateTType ty |> toTypeStructure)

This was intended to speed up things by weakly attaching the computed TypeStructures to their respective TTypes. We cannot do it for not fully solved types, they still mutate and things get outdated.

majocha · 2025-11-03T18:23:00Z

Wasn't the mistake more the fact that solved/unsolved form of the same TType_var had always the same key (based on Stamp) ?

Yes, in fact in case of typars we don't want the stamp as identity at all. We should use the structure of the solution (TType) if available or a common token for unsolved.

src/Compiler/Utilities/TypeHashing.fs

Co-authored-by: Brian Rourke Boll <[email protected]>

majocha · 2025-11-04T10:03:05Z

I'm testing this in the IDE with the notorious OpenTK 5.0. It improved things significantly, there are way less cache entries now but much more hits, resulting in 99% ratio with little memory use and no constant churn during edits:

|             Cache name              | hit-ratio | adds | updates |  hits   | misses | 
|-------------------------------------|-----------|------|---------|---------|--------|
| typeSubsumptionCache                |    99.94% | 2752 |       0 | 4618992 |   2752 |

src/Compiler/Utilities/TypeHashing.fs

T-Gro · 2025-11-04T10:31:41Z

src/Compiler/Utilities/TypeHashing.fs

+    let rec private accumulateTypar (typar: Typar) =
+        seq {
+            match typar.Solution with
+            | Some ty -> yield! accumulateTType ty


In general, the input going into the cache is already stripped, right?

i.e. we should not be getting long chains of solution pointers for something which is solved.

Yeah since this is operating solely on stripped types, if we encounter a type var, it should never be a solved one. In theory.

majocha · 2025-11-05T09:50:25Z

The huge mistake that causes a lot of inefficiency was emitting typar stamps as part of the cache key. This causes a lot of equivalent but really unusable keys polluting the cache. The increased churn is especially heavy on the IDE. I cannot see a solution to this, that would guarantee soundness.

T-Gro · 2025-11-05T14:55:47Z

The huge mistake that causes a lot of inefficiency was emitting typar stamps as part of the cache key. This causes a lot of equivalent but really unusable keys polluting the cache. The increased churn is especially heavy on the IDE. I cannot see a solution to this, that would guarantee soundness.

What about skipping cache for them?
For unsolved typars, the typefeasiblysubsubsumestypes is a quick decision in case of TType_var anyway.

Of course it somewhat limits the genericity of the cache, and is special-tailored to the place of application.

The typesubsumption was really needed for concrete types, with super types and interface hierarchies.
Maybe if Ttype_var is not part of it (after stripping), we can have both correctness and good cache contents?

I was thinking of content-based hashing, but you would need to hash all type constraints content-wise as they are mutable as well. Maybe the inherent mutation for solutions and constraints of a type var are good enough reasons to treat them differently?
(mutation is not well suited for cache keys)

majocha · 2025-11-05T23:11:26Z

So, basically, we care only about caching TType_app _ pairs.
But most of them have type vars. Often still unsolved when they end up in the checkSubsumes call, just like in the test here.
I tried to not cache the types with unsolved vars but the results are abysmal, looking at OpenTK 5.0 project as baseline build times go from 10 sec to forever.

I wonder if the constraints are really a problem wrt to type subsumption cache. Because the current situation where we emit only a stamp as typar identity worked so far ( the bug this PR tries to fix is elsewhere). A given typar can very well have its constraints mutated while the stamp stays constant, so it is already not collision free in general. It just does not collide in practice for this usage. I think it was like this since the original Vlad's implementation.

Copilot when asked says:

Q: which typar constraints can influence the result of TypeFeasiblySubsumesType? I wonder because I need to know what should take part in keying the TypeSubsumptionCache.

A: Short answer: none of the typar constraints affect TypeFeasiblySubsumesType.
Details:
• In TypeFeasiblySubsumesType, if either side is a type variable (TType_var), the function immediately returns true and does not consult the variable’s constraints.
• For nominal types (TType_app) with the same tycon, it delegates to TypesFeasiblyEquiv for each type argument. TypesFeasiblyEquiv also returns true when it encounters a TType_var (and ignores its constraints), and treats measure types as feasibly equivalent without comparing the actual measures.
• For tuples/anon/funs, it uses TypesFeasiblyEquiv similarly, so constraints are again ignored.
• The remaining branches follow nominal inheritance and interface chains of the concrete right-hand type. This depends on the type definitions/instantiations (and ultimately on metadata reachable via g/amap), not on typar constraint sets.
Implications for the cache key:
• You do not need to include any typar constraint environment in the TypeSubsumptionCache key.
• The existing key of stripped nominal type structures (when both sides are TType_app) plus CanCoerce is correct. This aligns with how the algorithm makes decisions.
• The cache is already partitioned per TcGlobals via the weak map, so differences in loaded metadata are naturally isolated.

hmm.

src/Compiler/Utilities/TypeHashing.fs

T-Gro · 2025-11-06T09:45:23Z

src/Compiler/Utilities/TypeHashing.fs

+
+        let tokens =
+            emitTType env ty
+            |> Seq.filter (fun t -> t <> TypeToken.Nullness NullnessInfo.WithoutNull)


You might as well solve it at emission time (i.e. not emit a nullness token at all into the sequence), or?

T-Gro · 2025-11-06T09:53:24Z

src/Compiler/Utilities/TypeHashing.fs

+
+                match r.Solution with
+                | Some ty -> yield! emitTType env ty
+                | None ->
+                    if r.Rigidity = TyparRigidity.Rigid then
+                        TypeToken.Rigid typarId
+                    else
+                        env.stable <- false
+                        TypeToken.Unsolved typarId


A basic checklist, please check each:

Infinity chains of typars are handled (by laziness and truncation)

Solve typar's hash is equal to its solution's hash (after nullness token filtering)

Solved typar's hash is not equal to unsolved typar's hash with the same stamp

Unsolved typar's are never weakly cached on the reference itself (solved by boolean flag to reference caching)

This is never called in parallel for the same input ttype (since the context access it not thread-safe)

The cache is kept small by caching only top-level Ttype_app and their contents are normalized and stamp-independent

The weak cache still makes sense, it just is not used whenever an unsolved typar is encountered. But for all other scenarios, it can avoid the getTypeStructureOfStrippedType call.

A basic checklist, please check each:

Infinity chains of typars are handled (by laziness and truncation)

Solved typar's hash is equal to its solution's hash (after nullness token filtering)

Solved typar's hash is not equal to unsolved typar's hash with the same stamp

Unsolved typar's are never weakly cached on the reference itself (solved by boolean flag to reference caching)

This is never called in parallel for the same input ttype (since the context access it not thread-safe)

The cache is kept small by caching only top-level Ttype_app and their contents are normalized and stamp-independent

The weak cache still makes sense, it just is not used whenever an unsolved typar is encountered. But for all other scenarios, it can avoid the getTypeStructureOfStrippedType call.

This one I am not entirely sure. I'm trying to informally test by building FCS net10.0 and also OpenTK 5.0, both of which have quite different profiles of cache use, OpenTK now does not benefit from weak memoization. I left it in with the editor use in mind, but now that the cache works only on TType_apps, I wonder if it still helps.

T-Gro · 2025-11-06T10:02:03Z

src/Compiler/Utilities/TypeHashing.fs

-    let inline toNullnessToken (n: Nullness) =
+    let private emitNullness env (n: Nullness) =
        match n.TryEvaluate() with
        | ValueSome k -> TypeToken.Nullness k


The trace based mutation+rollback might be a problem for the weakcache, see e.g. here

fsharp/src/Compiler/Checking/ConstraintSolver.fs

Lines 1045 to 1050 in 3828c87

| Nullness.Variable nv1, _ ->

trace.Exec (fun () -> nv1.Set nullness2) (fun () -> nv1.Unset())

CompleteD

| _, Nullness.Variable nv2 ->

trace.Exec (fun () -> nv2.Set nullness1) (fun () -> nv2.Unset())

CompleteD

.

Or another one here:

fsharp/src/Compiler/Checking/ConstraintSolver.fs

Line 961 in 3828c87

trace.Exec (fun () -> r.typar_solution <- Some ty) (fun () -> r.typar_solution <- None)

The typestructure has this covered, but weakcache has not.
But fully clearing the weakcache when a trace's undo is called is way too defensive and would likely sacrifice a lot of perf potential :-(

I think the weakcache is not as crucial now, when we limit only to TType_app. Is nullness taken into account for TypeFeasiblySubsumesType? Maybe it's possible to not emit it at all.
(In the long run type structure generation could be made more configurable if needed, too.)

Ah, this happens not just for nullness.

Yes, happens for typar solutions - e.g. method overloading, also in context of picking SRTP overloads.

If we are in a Trace environment within constraint solving, it would be safest to never add things to the WeakCache.

(alternatively add it there, but then remove at Undo - but there might be overhead in keeping track what to remove, I guess safer not to add...)

Yes, as first approximation we can just not memoize when any typars come into play, solved or not. Good thing the performance is still way better than before.

src/Compiler/Utilities/TypeHashing.fs

majocha · 2025-11-06T11:31:24Z

I quickly asked copilot:

Q: Does TypeFeasiblySubsumesType take nullness into account ?

A:

Short answer: No. The predicate is intentionally nullness-agnostic.
Why:
• In the core match it compares constructed types as TType_app (tc1, l1, _), TType_app (tc2, l2, ) and discards the third field (), which is where the nullness marker lives. Only tycon equality and argument feasibility are checked via TypesFeasiblyEquiv, which likewise ignores nullness.
• The special-case for object uses isObjTyAnyNullness g ty1, i.e., any nullness of obj is accepted as a supertype of any reference type, again ignoring nullness compatibility.
• Supertype/interface traversal (GetSuperTypeOfType, GetImmediateInterfacesOfType) calls back into TypeFeasiblySubsumesType without reconciling nullness.
Related:
• FindUniqueFeasibleSupertype preserves the nullness of ty2 in the result (addNullnessToTy n2 t), but the feasibility test itself still uses TypeFeasiblySubsumesType and thus ignores nullness.
• ChooseTyparSolution may add nullness due to constraints, but that’s orthogonal to the subsumption check.
If you need a nullness-aware check, compare nullnessOfTy g ty1 and nullnessOfTy g ty2 explicitly alongside calling TypeFeasiblySubsumesType.

github-project-automation bot added this to F# Compiler and Tooling Oct 30, 2025

github-project-automation bot moved this to New in F# Compiler and Tooling Oct 30, 2025

majocha marked this pull request as ready for review October 30, 2025 13:10

majocha requested a review from a team as a code owner October 30, 2025 13:10

majocha marked this pull request as draft November 3, 2025 17:13

majocha added 3 commits November 3, 2025 18:42

add test

e3d1871

release notes

aec47e7

for unsolved always recompute TypeStructure

b4b951c

majocha force-pushed the fix-unsolved branch from 075d37a to b4b951c Compare November 3, 2025 17:50

majocha marked this pull request as ready for review November 3, 2025 18:12

brianrourkeboll reviewed Nov 3, 2025

View reviewed changes

src/Compiler/Utilities/TypeHashing.fs Outdated Show resolved Hide resolved

nit

0939fba

Co-authored-by: Brian Rourke Boll <[email protected]>

majocha changed the title ~~Do not cache unsolved typars~~ Type subsumption cache: handle unsolved type vars Nov 4, 2025

T-Gro reviewed Nov 4, 2025

View reviewed changes

src/Compiler/Utilities/TypeHashing.fs Show resolved Hide resolved

T-Gro reviewed Nov 4, 2025

View reviewed changes

majocha marked this pull request as draft November 4, 2025 12:46

majocha added 3 commits November 4, 2025 16:20

Merge branch 'main' into fix-unsolved

ad681fa

wip

5f86502

emit stamps for constrained vars only

068b3b5

majocha added 2 commits November 6, 2025 01:15

TType_app only

25357f8

merge main

73c80ff

majocha force-pushed the fix-unsolved branch from b48fc74 to 73c80ff Compare November 6, 2025 00:26

memoize only stable structures

e22f124

majocha force-pushed the fix-unsolved branch from e62dfcf to e22f124 Compare November 6, 2025 09:16

T-Gro reviewed Nov 6, 2025

View reviewed changes

src/Compiler/Utilities/TypeHashing.fs Show resolved Hide resolved

majocha marked this pull request as ready for review November 6, 2025 09:42

T-Gro reviewed Nov 6, 2025

View reviewed changes

Merge branch 'main' into fix-unsolved

5065dc7

majocha commented Nov 6, 2025

View reviewed changes

src/Compiler/Utilities/TypeHashing.fs Outdated Show resolved Hide resolved

omfg

105e9aa

majocha added 2 commits November 6, 2025 14:29

do not weakly memoize when typars are involved, ignore nullness

cb423e5

merge main

9c5037a

majocha marked this pull request as draft November 6, 2025 14:38

Merge branch 'main' into fix-unsolved

95711be

majocha marked this pull request as ready for review November 6, 2025 16:20

	\| Nullness.Variable nv1, _ ->
	trace.Exec (fun () -> nv1.Set nullness2) (fun () -> nv1.Unset())
	CompleteD
	\| _, Nullness.Variable nv2 ->
	trace.Exec (fun () -> nv2.Set nullness1) (fun () -> nv2.Unset())
	CompleteD

Type subsumption cache: handle unsolved type vars #19040

Are you sure you want to change the base?

Type subsumption cache: handle unsolved type vars #19040

Conversation

majocha commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Uh oh!

github-actions bot commented Oct 30, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

❗ Release notes required

Uh oh!

majocha commented Oct 30, 2025

Uh oh!

T-Gro commented Nov 3, 2025

Uh oh!

majocha commented Nov 3, 2025

Uh oh!

T-Gro commented Nov 3, 2025

Uh oh!

majocha commented Nov 3, 2025

Uh oh!

majocha commented Nov 3, 2025

Uh oh!

Uh oh!

majocha commented Nov 4, 2025

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

majocha Nov 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

majocha commented Nov 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

T-Gro commented Nov 5, 2025

Uh oh!

majocha commented Nov 5, 2025

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

majocha Nov 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

majocha commented Nov 6, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

majocha commented Oct 30, 2025 •

edited

Loading

github-actions bot commented Oct 30, 2025 •

edited

Loading

majocha Nov 4, 2025 •

edited

Loading

majocha commented Nov 5, 2025 •

edited

Loading

majocha Nov 6, 2025 •

edited

Loading